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Abstract 

Background: Disseminated cancer cells (DCCs) and circulating tumor cells (CTCs) are extremely rare, but comprise the 
precursors cells of distant metastases or therapy resistant cells. The detailed molecular analysis of these cells may help to 
identify key events of cancer cell dissemination, metastatic colony formation and systemic therapy escape. 

Methodology/Principal Findings: Using the Amplil™ whole genome amplification (WGA) technology and high-resolution 
oligonucleotide aCGH microarrays we optimized conditions for the analysis of structural copy number changes. The 
protocol presented here enables reliable detection of numerical genomic alterations as small as 0.1 Mb in a single cell. 
Analysis of single cells from well-characterized cell lines and single normal cells confirmed the stringent quantitative nature 
of the amplification and hybridization protocol. Importantly, fixation and staining procedures used to detect DCCs showed 
no significant impact on the outcome of the analysis, proving the clinical usability of our method. In a proof-of-principle 
study we tracked the chromosomal changes of single DCCs over a full course of high-dose chemotherapy treatment by 
isolating and analyzing DCCs of an individual breast cancer patient at four different time points. 

Conclusions/Significance: The protocol enables detailed genome analysis of DCCs and thereby assessment of the clonal 
evolution during the natural course of the disease and under selection pressures. The results from an exemplary patient 
provide evidence that DCCs surviving selective therapeutic conditions may be recruited from a pool of genomically less 
advanced cells, which display a stable subset of specific genomic alterations. 

Citation: Czyz ZT, Hoffmann M, Schlimok G, Polzer B, Klein CA (2014) Reliable Single Cell Array CGH for Clinical Samples. PLoS ONE 9(1): e85907. doi:10.1371/ 
journal.pone.0085907 

Editor: Anthony W.I. Lo, The Chinese University of Hong Kong, Hong Kong 

Received August 30, 2013; Accepted December 7, 2013; Published January 21, 2014 

Copyright: © 201 4 Czyz et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted 
use, distribution, and reproduction in any medium, provided the original author and source are credited. 

Funding: This study was funded by The German Cancer Aid (Deutsche Krebshilfe): grant number 1 09753. The funders had no role in study design, data collection 
and analysis, decision to publish, or preparation of the manuscript. 

Competing Interests: The authors have declared that no competing interests exist. 

* E-mail: christoph.klein@klinik.uni-regensburg.de 



Introduction 

Comprehensive analysis of minute quantities of genomic DNA 
has become important in a variety of forensic, diagnostic and 
biological studies. For example, in cancer research or pre- 
implantation diagnostics, the number of available cells for 
downstream analyses may be as low as one single cell. In cancer 
research, single-cell technologies are increasingly needed to study 
the course of metastatic spread of cancer cells. Multiple studies 
conducted in the past have shown that the presence of circulating 
tumor cells (CTCs) in the peripheral blood or disseminated cancer 
cells (DCCs) in the bone marrow (BM) or lymph nodes (LN) is an 
independent prognostic factor of poor outcome of almost all tested 
cancer types [1-5]. Strikingly, it could be shown that cancer cells 
disseminate very early during the course of disease and evolve in 
parallel to the tumor cells at the primary site [6-8] . These findings 
were supported by significant genetic disparity observed between 
the primary tumors (PTs) and corresponding DCCs [9-1 1] as well 
as among DCCs themselves [12]. Subsequent functional studies 
demonstrated that, at least in the case of esophageal cancer, DCCs 
show different susceptibility to applied anti-cancer treatment than 
cancer cells originating from the primary lesion [1 1]. In line with 
this, studies in breast cancer have shown that DCCs and CTCs 
may survive the first line treatment indicating their intrinsic or 
acquired resistance to cancer therapy [13,14]. For all of these 



reasons, detailed analysis of DCCs and CTCs may help to identify 
genes and pathways allowing cancer cells to leave the primary 
lesion, survive in the circulation for extended periods of time, 
colonize distant sites, and survive systemic therapies. 

A variety of analytical techniques have been developed to 
amplify and study the genomes of single-cells [15-23]. Chromo- 
somal comparative genomic hybridization (cCGH) could be 
adapted to analyze single-cell DNA and identify highly penetrant 
alterations in the genomes of DCCs [19]. This method, although 
comprehensive, is very labor-intensive and allows only detection of 
aberrant regions larger than 10-20 Mb. Implementation of array 
CGH (aCGH) technology revolutionized the study of single-cell 
cancer genomes. A single-cell aCGH assay using tiling path BAC 
array platform described by Fiegler et al. allowed detection of a 
deletion of 8.3 Mb [24]. Using arrays composed of highly purified 
BAC clones previously we identified aberrant regions as small as 
1-2 Mb in cell lines and 4.8 Mb in DCCs [25]. More recent 
studies indicate that using high-density oligonucleotide micro- 
arrays the detection limit of single-cell aCGH can be reduced to 
1 Mb or less in freshly isolated cells [26,27]. Despite these 
advances an additional hurdle consists in the requirements 
imposed by clinical samples. So far, it has not been extensively 
studied how fixation and staining methods used to identify CTCs 
and DCCs may influence the outcome of the single-cell aCGH. 
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The objective here was to establish a robust single-cell aCGH 
protocol allowing reliable detection of genomic alterations in 
patient-derived DCCs. We applied Amplil™ single-cell WGA 
technology together with SuperPrint G3 4x180 k Agilent aCGH 
microarrays to provide a precise and easy to use workflow for high- 
resolution assessment of copy number changes in single cells. We 
show that the new workflow displays high specificity and enables 
reliable assessment of the copy number changes in single DCCs, 
which may be used to address the cellular heterogeneity in cancer. 
Finally, we demonstrate the potential of our new technique in a 
case study of DCCs isolated from a patient with advanced breast 
cancer disease during the course of high-dose chemotherapy 
treatment. 

Materials and Methods 

Ethics statement 

Bone marrow sampling was performed within the study 
protocol of the GEBDIS study at the Central Hospital in 
Augsburg after informed written consent of patients was obtained. 
The ethics committees of the University of Tubingen and of the 
University of Regensburg (ethics vote number 07-079) approved 
bone marrow sampling and genomic analysis of the isolated cells. 
Additionally, as control and reference samples, we used single cells 
from the mononuclear cell fraction of peripheral blood obtained 
from five healthy donors. Three donors provided written informed 
consent after obtaining approval by the ethics committee of the 
University of Regensburg (ethics vote number 12-101-0038) and 
two healthy donors provided verbal informed consent. The latter 
samples were taken before 2008 when no ethics vote for voluntary 
blood donations of healthy donors was required. 

Cell lines 

BT-474 and SKBR3 cell lines were obtained from repository of 
the German Collection of Microorganisms and Cell Cultures at 
the Leibniz Institute DSMZ (Braunschweig, Germany), other cell 
lines including MDA-MB-453 and MDA-MB-361 were obtained 
from the American Type Culture Collection (ATCC). OE-19 was 
obtained from the European Collection of Cell Cultures. PT1590 
cell line was originally generated at the University Medical Center 
Hamburg- Eppendorf [28] . All cell lines have been maintained in 
the conditions recommended by the distributor. Identity of all cell 
lines was confirmed using a PCR-based fingerprinting. 

Detection and isolation of disseminated cancer cells 

The procedure for bone marrow preparation has been 
described elsewhere [12,29]. Bone marrow or blood cells (1— 
2 x 1 0 6 ) were stained using either monoclonal antibody clone A45- 
B/B3 against cytokeratins 8,18,19 or monoclonal antibody clone 
CK2 against cytokeratin 18. Control cells (from healthy donors) 
were stained with monoclonal antibody specific for vimentin (clone 
V9, Dako). Mouse IgGl Kappa (MOPC-21) was used as isotype 
control for all immunocytochemical experiments. Visualization 
was carried out using alkaline phosphatase/anti-alkaline phospha- 
tase technique with 5-bromo-4-chloro-3-indolyl phosphate and 
nitroblue tetrazolium (BCIP/NBT) used as substrate. 

Primary whole genome amplification (WGA) 

All single cells, cell pools (few hundred cells) and microdissected 
specimens from formalin fixed paraffin embedded (FFPE) tumor 
tissue were amplified using the Amplil™ WGA Kit (Silicon 
Biosystems) [12,19]. 



Re-amplification of the WGA products 

Re-amplification was performed in a volume of 50 ul. Each 
PGR reaction was composed of the following ingredients: 5 ul 
Expand Long Template Buffer 1 (Roche Diagnostic), 1 uM of the 
LIB1 (5 ' -TAGTGGGATTC CTGCTGTC AGT-3 ') orMseLig-21 
primer (5'-AGTGGGATTCCTGCTGTCAGT-3') - depending 
on the adapter used in the primary WGA, 1.75 ul dNTPs 
(10 mM), 1.25 ul BSA (Roche Diagnostic), 2.5 U of Expand- 
Long-Template DNA Polymerase (Roche Diagnostic) and 1.0 |xl 
of the template DNA. The MJ thermocycler was set as follows: 1 
cycle of 94°C for 60 sec, 60°C for 30 sec, 65°C for 2 min, 10 cycles 
of 94°C for 30 sec, 60°C for 30 sec, 65°C for 2 min (extended by 
20 sec/cycle). Typically three reactions were run in parallel, which 
were pooled and used as template for DNA labeling and aCGH. A 
negative control was included in every run. 

Labeling of sample DNA 

Random-primed DNA labeling approach (RP label- 
ing). Test and reference DNA samples were labeled using 
SureTag DNA Labeling Kit (Agilent Technologies) according to 
the instruction provided by the supplier (Agilent Oligonucleotide 
Array-Based CGH for Genomic DNA Analysis, version 7.1, 
December 2011). Briefly, 1.5 - 2.0 ug of the purified input DNA 
(WGA product or unamplified genomic DNA) was supplemented 
with 5 ul of Random Primer Mix and filled up with H z O to 31 \A. 
Unamplified DNA and WGA products samples were denatured at 
95°C for 10 or 3 minutes, respectively. Sample tubes were 
transferred on ice and incubated for 5 min. The labeling reaction 
with exo-Klenow fragment consisted of the following ingredients: 
31 ul of denatured DNA, 10 ul of 5x Reaction Buffer, 5.0 ul of 
lOx dNTP Mix, 3.0 ul of Cy5-dUTP (test) or Cy3-dUTP 
(reference) and 1.0 ul of Exo(-) Klenow fragment. Labeling 
reaction was run at 37°C for two hours, followed by an 
inactivation step at 65°C for 10 minutes. Labeled DNA was 
purified using Ultra 0.5 purification system with a size cut-off of 
30 kDa. DNA yields and dye incorporation rates were quantified 
using the NanoDrop ND-1000 instrument. 

PCR-based labeling. PCR-based labeling using dye-conjugated 
universal primer (PCR-T1). Placement of the dye on the universal 
primer provides the advantage that all restriction digestion 
fragments present in the WGA product irrespectively of their size 
will be labeled with the same amount of dye. To avoid cross- 
hybridization of adapter sequences flanking amplicons in the 
WGA products, test and reference samples were labeled using 
different PCR-adapters. Test samples were labeled with the PCR- 
adapter incorporated in the Amplil™ WGA kit, while all the 
reference DNA samples were amplified using the following 
adapters: MIB5 (5 '-TGAGCTGGTCATTGCGCATGGT-3 ') 
and ddMse XI (5'-TAACCATGCGC-3'). Universal primers used 
in the labeling reaction were directly conjugated with either Cy5 in 
the case of Amplil™ universal primer [5'-TAGTGG- 
GATTCCTGCTGTCAGT-3'] or Cy3 in the cases of MIB5 
primer [5 '-TGAGCTGGTC ATTGCGCATGGT-3 '] (under- 
scores indicate the placement of the dye). The labeling PCR was 
run in a total volume of 50 ul reaction composed of 5 ul of lOx 
Expanded Long Template Buffer 1 (Roche Diagnostics), 2.4 uM 
of dye-conjugated LIB1 or MIB5 primer (for test and reference 
sample, respectively), 350 uM of dNTPs, 0.5 ul BSA (Roche), 3.75 
U of Expand-Long-Template DNA Polymerase (Roche Diagnos- 
tics) and 1.0 ul of the template DNA. The PCR was programmed 
as follows: 10 cycle of 94°C for 15 sec, 51°C for 30 sec and 65°C 
for 3:30 min, 2 cycles of 94°C for 15 sec, 51°C for 30 sec and 65°C 
for 3:30 min (extended by 10 sec every cycle), followed by a final 
elongation step of 7 min at 65 °C. Products were purified using the 
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Amicon Ultra 0.5 System (cut-off size 100 kDa) and quantified 
with the NanoDrop ND-1000 instrument. 

PCR-based labeling using incorporation of dye-conjugated dNTPs (PCR- 
T2). WGA products were labeled by PCR in the presence of Gy5 
or Cy3 conjugated dCTP and dUTP. Incorporation of the 
fluorescent dyes using the dNTPs provided the advantage that 
adapters flanking each amplicon in the WGA product could be 
removed prior to aCGH hybridization, thereby avoiding unspe- 
cific cross-hybridization of test sample with reference DNA and 
oligonucleotide probes on the array. For each sample two 50(0.1 
PCR reactions were run in parallel each comprising 5 ul of lOx 
Expanded Long Template Buffer 1 (Roche Diagnostics), 2.4 |iM 
of the universal primer (LIB 1 or MseLig-2 1 , depending on the 
adapter sequence incorporated in the WGA product), 350 uM of 
dATP and dGTP, 315 uM of dCTP and dTTP, 35 uM of Cy5/ 
Cy3-dCTP and Cy5/Cy3-dUTP (GE Healthcare), 0.5 ul of BSA 
(Roche Diagnostics), 7.5 U of Expand-Long-Template DNA 
Polymerase (Roche) and 0.5 ul of the template DNA. The PCR 
cycler was programmed as follows: 10 cycle of 94°C for 15 sec, 
60°C for 30 sec and 65°C for 3:30 min, 2 cycles of 94°C for 15 sec, 
60°C for 30 sec and 65°C for 3:30 min (extended by 10 sec every 
cycle), followed by a final elongation step of 7 min at 65°C. 
Subsequently, the resulting PCR products were subjected to 
digestion with Trull restriction endonuclease to cleave off the 
PCR-adapters. For this purpose the PCR product was supple- 
mented with 5.6ul of Buffer R and 30 U of Trull. Digestion was 
performed at 65°C for 3 hours. Resulting products were pooled 
and purified using the Amicon Ultra 0.5 System (100 kDa cut-off). 
DNA yields and dye incorporation rates were quantified using the 
NanoDrop ND-1000 Instrument. 

Array comparative genomic hybridization 

Array CGH was performed on oligonucleotide-based SurePrint 
G3 Human CGH 4xl80K microarray slides (design code: 
022060) according the protocol provided by the manufacturer 
(Agilent Oligonucleotide Array-Based CGH for Genomic DNA 
Analysis, version 7.1, December 2011). Slight modifications were 
introduced WGA products processed with the PCR-based labeling 
approaches. Here, the hybridization mix consisted of 5.0 Ug of 
Cot 1 -DNA (Roche Diagnostics), 12 ul of lOx Blocking Reagent 
(Agilent Technologies), 60 ul of 2x Hi RPM Hybridization Buffer, 
1 % (v/v) of both Tween20 and Igepal and 1 9 ul of both test and 
reference DNA. For each hybridization 100 ul of the hybridization 
mix was applied on the array and hybridized at 65°C for 24 h. 
Following the hybridization, the slides were washed twice for 2:30 
min in Oligo aCGH/ChlP-on-Chip Wash Buffer 1 (Agilent 
Technologies) at room temperature, twice for 30 sec min in Oligo 
aCGH/ChlP-on-Chip Wash Buffer 2 (Agilent Technologies) at 
37°C. Washed slides were immersed in acetonitrile to remove all 
remaining traces of the wash buffers. Finally, slides were scanned 
using an Agilent Microarray Scanner Type C. 

Processing and analysis of the aCGH data 

Microarray TIFF image files were processed with the Agilent 
Genomic Feature Extraction Software (version 10.7). The resulting 
text files were imported and analyzed with Agilent Genomic 
Workbench Software (version 6.5 Lite). Aberrant regions were 
recognized using ADM-2 algorithm with threshold set to 6.5. 
Centralization algorithm was set to a threshold of 6.0 and bin size 
of 10. To avoid false positive calls, aberration filters were applied 
to define the minimum log2 ratio (0.25 for unamplified DNA or 
WGA products from freshly picked cell and 0.3 for WGA product 
from cell stained for vimentin or cytokeratins) and the minimum 
number of probes in an aberrant interval (5 and 25 for unamplified 



DNA and WGA products, respectively). The microarray data 
presented in this manuscript has been deposited at Gene 
Expression Omnibus (GEO) database (accession number: 
GSE52366). 

Identification of minimal regions of aberration (MRAs) 

MRA refers to the smallest region of genomic copy number 
alteration overlapping across an analyzed set of aCGH profiles. 

Statistical analysis 

Identification accuracy was assessed using separate ROC curves 
for gains and losses. To this end segmentation profiles obtained by 
the ADM2 algorithm were binarized according to threshold values 
(Thr): for gains, values £ Thr were set to unity and 0 otherwise, 
for losses: values ^ —Thr were set to unity and 0 otherwise. 
Reference profiles were binarized according to a fixed threshold of 
Thr = 0.25, while test profiles were binarized for different values 
of Thr. ROC curves were obtained by comparing a reference 
binarization to test binarizations for different positive threshold 
values. For clustering, segmentation profiles were discretized 
according three levels, i.e. values & Thr were labeled as gains by 1, 
values £ -Thr were labeled as losses by -1, and all other values 
were labeled by zero. Hierarchical clustering was performed using 
Euclidean distance and agglomeration by complete linkage. 
Analyses were done using R [R: A Language and Environment 
for Statistical Computing, R Core Team,R Foundation for 
Statistical Computing, Vienna, Austria, 2013, url = http:// 
www.R-project.org] . 

Results 

Figure 1 provides an overview of the sample processing and the 
experimental set-up used to obtain optimized assessment of single 
cell copy number changes. Based on our experience with single 
cell cCGH we focused on those labeling methods that are 
compatible with the WGA procedure and may have an impact on 
the aberration calls. 

Optimization and validation of the single-cell aCGH assay 

First, we developed two customized PCR-based approaches for 
labeling Amplil™ single-cell WGA products and compared them 
with the recently published protocol using random-primed 
isothermal incorporation of fluorescent dyes (RP labeling) [27]. 
The first PCR-based protocol employed a dye-conjugated 
universal primer (PCR-T1) that binds to the adapter sequence 
utilized in the primary WGA. The second approach was based on 
the incorporation of dye-conjugated dNTPs (PCR-T2) during a 
short re-amplification step. Both techniques utilized the same 
universal primer as incorporated in the Amplil™ WGA proce- 
dure, thereby enabling comprehensive labeling of the complete 
representation of single-cell DNA. Both labeling approaches were 
tested for their accuracy and reliability using PT1590 esophageal 
cancer cell line cells [28]. We generated three single-cell and two 
WGA products of cells pooled from PT1590 cell line. Paired 
samples (single cell and cell pool) for both PT1590 cell line and 
healthy donor were labeled in parallel with both techniques and 
subsequently hybridized on Agilent SurePrint G3 Human 4x 180k 
arrays. As benchmark, we used aCGH profiles obtained with 
unamplified genomic DNA. Samples processed with PCR-T1 
provided consistendy higher dye incorporation rates, whereas 
PCR-T2 generated higher DNA yields (Table SI in the Text SI). 
More importantly, signal intensities and signal-to-noise ratios 
obtained for both dyes (Cy5-red/test and Cy3-green/reference), 
were higher for samples processed with PCR-T2. The direct 
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Figure 1. Experimental approach. Overview of the single cell aCGH procedure: single cells were isolated by micromanipulation and subjected to 
Ampin™ WGA protocol. Primary WGA product can be re-amplified for further downstream applications (optional step). DNA was labeled by RP 
labeling or PCR-based approaches (PCR-T1 and PCR-T2). Subsequent hybridization was carried out on Agilent SurePrint G3 Human 4x1 80k arrays 
resulting data was evaluated with Agilent Genomic Workbench Software. 
doi:1 0.1 371 /journal.pone.0085907.g001 



comparison of PCR-T1 and PCR-T2 revealed striking disparities 
between single-cell aCGH profiles of the same individual cell 
(Figure S1A-C). When compared to bulk DNA we found that 
multiple genomic alterations could be detected only if labeling 
PCR-T2 was applied (Figure S1A-C). The better fit of PCR-T2 
aberrations with results from bulk genomic DNA was quantified 
by subsequent ROC-analysis (Figure SID). In addition, repeated 
hybridization of single-cell WGA products processed with PCR- 
T2 gave highly reproducible results (Figure S2). 

To further improve the performance of the assay we searched 
for the best performing reference sample. To this end we 
compared two sample types: (i) WGA products generated from 
cell pools and (ii) samples obtained by pooling four single-cell 
WGA products. Direct comparison revealed that derivative log2 
ratio spread (DLRS) values, a measurement of hybridization noise, 
were consistently higher when WGA products from cell pools were 
used as reference (Figure S2). As a consequence all subsequent 
experiments included a pool of four single cell WGA products 
originating from one healthy individual as a reference sample. 

Next we compared the performance of PCR-T2 with the 
recently published RP labeling procedure [27]. We started with 
using freshly picked single cells of OE-19 esophageal cancer cells. 
Irrespectively of the labeling approach used, two aberrations of 
only 0.1 Mb in size (one homozygous deletion and one amplifi- 
cation) could be reliably and reproducibly detected in all OE-19 
single cells (Figure 2A-B) and the aCGH profiles of single OE-19 
cells were highly concordant with the results of unamplified DNA 
(Figure 2C-D). In addition, we tested both labeling techniques 
when applied on re-amplified WGA products of OE-19 cells. 
Independently of the labeling technique applied, aCGH profiles of 
both re-amplified and primary single-cell WGA products as well as 
with unamplified DNA showed high level of concordance (Figure 
S3A-B). This was also true for single cell WGA products obtained 
from healthy donors displaying balanced profiles (Figure S3C). 
Therefore, the second round of amplification did not introduce a 
significant amplification bias. Since fixation and staining protocols 
used for detection of DCCs may affect the quality of single cell 
DNA [30,31], we also compared PCR-T2 and the RP labeling for 
clinical samples. Here, we could show that when applied to 
immunostained cells PCR-T2 is more robust and less susceptible 
to generate technical bias than it is the case with RP labeling. 
Therefore, PCR-T2 became the method of choice for processing 
the Amplil ™ WGA product in all subsequent experiments (Figure 
S6 and Figure S7; text SI). 

Detection of cell-to-cell heterogeneity in single cells 

We evaluated the capability of our assay based on PCR-T2 
labeling to assess genomic heterogeneity at the single cell level. W e 
analyzed three single-cell WGA products, cell-pool WGA product 



and corresponding unamplified DNA of PT-1590 cell line. PT- 
1590 cells were previously shown to harbor amplification of 
ERBB2 gene [11]. Surprisingly, we found by aCGH analysis that 
one out of three analyzed PT1590 cells did not display the 
amplification of ERBB2 (Figure 3, Figure S4). To determine 
whether this finding was a technical artifact or reflecting true 
cellular heterogeneity, we performed FISH analysis. We found 
that about 10% of the PT1590 cells indeed displayed a balanced 
copy number ratio at the ERBB2 locus (Figure 3B). GNAs found 
in single cells only may thus represent true-positive events that 
escape detection in the bulk DNA where they are masked by the 
heterogeneity of the cell population. (Figure 3B, Figure S4). 

Quantitative assessment of CNAs in single cells 

To further challenge our method, we asked whether we could 
correctly quantify copy number changes, i.e. assess the level of 
gene amplification in single cells. For this, we analyzed four 
different breast cancer cell lines (MDA-MB-453, MDA-MB-361, 
SK-BR-3 and BT-474) each representing different copy number of 
the ERBB2 locus (FISH ERBB2 to CEP1 7 ratios equal 2.15, 3.60, 
4.42, 5.79, respectively). For each cell line both unamplified DNA 
and single-cell WGA products were analyzed by aCGH giving 
highly concordant profiles (Figure 4A). The average log2 ratios of 
probes representing the ERBB2 locus in single-cell aCGH 
experiments closely correlated with corresponding log2 values 
obtained for unamplified DNA and FISH ratios of ERBB2 
(Pearson's correlation coefficient 0.94 and 0.97, respectively, 
Figure 4 B-C). 

Application to single DCCs 

To test the assay on archival patient samples, we analyzed 
DCCs and corresponding metastatic tissue of an individual breast 
cancer patient. At the time of primary tumor resection (July 1 998) 
this patient was diagnosed with stage IV breast cancer (PT4b, 
Nlbii, pMl, G3, ER+, PR+, HER2-) with metastases detected in 
bones and lymph nodes and was subsequently treated with high 
dose chemotherapy (Figure 5A). During the course of treatment 
bone marrow was aspirated four times and screened for the 
presence of DCCs (Figure 5B). The DCC count was steadily 
decreasing throughout the treatment (Figure 5B), however it never 
reached zero. From each time point we analyzed two DCCs using 
our aCGH method. Additionally, DNA was collected from the 
primary tumor and a lymph node metastasis, amplified with 
Amplil™ WGA and subjected to aCGH. Subsequent aCGH 
analysis revealed multiple highly penetrant alterations among 
DCCs and corresponding tumor tissue samples (Figure S5). 
Systematic analysis of CNAs revealed 43 minimal regions of copy 
number alteration (MRAs), nine of which were shared by samples 
from the primary lesion, the lymph node metastasis and at least 
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Figure 2. PCR-based PCR-T2 labeling technique vs. RP labeling. A) Chromosome specific aCGH profiles of chromosome 8. B) Chromosome 
specific aCGH profiles of 17. Each panel represents aCGH profiles generated with unamplified and single-cell gDNA (PCR-T2 labeling) - left and 
middle plot, respectively. The right plot of each panel represents magnified graphical overview of genes within loci recognized as aberrant. C) ROC- 
curves (corresponding to profiles presented in panel A) depicting the accuracy of single cell aCGH assay for PCR-T2 or RP labeling. The array CGH 
profile generated using unamplified gDNA of OE-19 cells was taken as reference for the comparison. ROC analysis was performed on a genome-wide 
basis. D) Genome wide aCGH profiles of OE-19 cells generated using unamplified gDNA (upper panel) and a single-cell WGA product labeled with 
PCR-T2 (middle panel) or RP labeling approach. 
doi:1 0.1 371 /journal.pone.0085907.g002 
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Figure 3. Assessment of the cellular heterogeneity. A) Vertical aCGH profiles of chromosome 17 for unamplified gDNA and single-cell WGA 
products of PT1590 cells. Red arrow indicate ERBB2 locus. Note, one single cell (cell 3) shows a balanced profile at this site. Black arrows indicate 
private genetic alteration detected only in individual cells. B) Representative FISH images of PT1590 cells. Red signals indicate ERBB2 locus and green 
CEP17. White arrows label cells with balanced copy number of ERBB2 vs. CEP17. Yellow arrowhead shows cells with high-level amplification of the 
ERBB2 locus. 

doi:1 0.1 371 /journal.pone.0085907.g003 



PLOS ONE | www.plosone.org 



5 



January 2014 | Volume 9 | Issue 1 | e85907 



Single-Cell Array CGH for Clinical Samples 




Figure 4. Quantitative assessment of copy number changes in tumor cells by single cell aCGH. A) Vertical aCGH profiles of chromosome 
17 of four breast cancer cell lines with increasing copy number of ERBB2 locus (MDA-MB-453, MDA-MB-361, SKBR3 and BT474) generated using 
unamplified DNA (upper row) or single-cell WGA products (lower row). Red brackets indicate the position of the ERBB2 locus. Corresponding FISH 
ratios (ERBB2 vs. CEP17) of all cell lines are indicated in blue brackets. B) Correlation of average log2 values of ERBB2 specific probes obtained in 
single-cell aCGH experiments (Y-axis) vs. corresponding values obtained with unamplified DNA (X-axis). DNA samples from four breast cancer cell 
lines (MDA-MB-453, MDA-MB-361, SKBR3, BT474) have been included in the analysis. Pearson's correlation coefficient 0.94. C) Correlation of average 
log2 values specific for ERBB2 locus obtained in single-cell aCGH experiments (Y-axis) vs. FISH ratios (ERBB2/CEP17) calculated for four breast cancer 
cell lines: MDA-MB-453, MDA-MB-361, SKBR3, BT474. Pearson's correlation coefficient 0.97. 
doi:1 0.1 371 /journal.pone.0085907.g004 



one of the DCCs (Figure 5C, Table S2 in the Text SI). Among the 
shared alterations, we identified five core MRAs shared by all 
samples (Figure 5E) - gains on 17q and 8q and losses on 
chromosomes 6, 12 and 13 - harboring multiple oncogenes and 
tumor suppressor genes, i.e. MYC, MTDH, RBI and CDKN2A, 
that were previously associated with the progression of breast 
cancer [32,33]. Additional three MRAs were shared between the 
metastatic tissue and all DCCs but not with the PT (Figure 5E). 
Hierarchical clustering revealed that 7 out of 8 DCCs were 



genetically different from the primary tumor and displayed more 
similarities to the lymph node metastasis (Figure 5D). The detailed 
inspection revealed substantial heterogeneity for various CNAs in 
the genomes of DCCs (Figure S5). One of the DCCs collected at 
the time point of the third bone marrow sampling (DCC $1, time 
point 3) lacked multiple aberrations shared by the other sister cells 
(Figure S5). This cell showed the highest genomic similarity to the 
primary tumor (Figures 5D), however it harbored fewer genomic 
alterations than the primary lesion (Table S3 in the Text SI). 
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Hierarchical clustering confirmed the genomic divergence of cell 
DCC $1 from time point 3 in comparison with the remaining 
DCCs and the metastatic tissue. Lack of otherwise recurrent CNAs 
was also observed in DCC ^2 from time point 4. In this case, 
however, the cell harbored additional unique copy number 
changes (both gains and losses), indicating the existence of 
subclones within the DCC population (Figure S5). 

Discussion 

In this study, we established and validated a novel aCGH 
protocol for single cells enabling detection of chromosomal 
changes in single DCCs. This could be achieved by combined 
use of Amplil ™ WGA technology together with customized DNA 
labeling technique and high-resolution Agilent SurePrint G3 
Human 4x180 k oligonucleotide microarray platform. Data 
published by us and others indicate that Amplil technology is 



well suited for aCGH based analysis of DNA from single cells 
[25,27] as well as DNA extracted form FFPE tissue [34]. Here, we 
show a new workflow for single-cell aCGH which, when applied 
on patient-derived clinical samples, appears to generate fewer 
artificial gains and losses than a recently published protocol [27]. 

First, we developed two novel PCR-based labeling approaches 
designed specifically to match the Amplil WGA products and 
compared it with recendy published RP based labeling [27]. 
Direct comparison of the RP labeling with PCR-T2 technique 
provided highly comparable results in terms of sensitivity and 
reliability of both protocols. Importandy, the labeling techniques 
did not introduce any noticeable bias into the original WGA 
representation. In contrast to RP labeling, however, utilization of 
PCR-T2 technique requires only minute amount of DNA, leaving 
sufficient amount of the original WGA product for other high- 
throughput downstream application i.e. next generation sequenc- 
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Figure 5. Molecular findings in individual DCCs over the course of systemic treatment. A) Chemotherapy regime: first patient was 
subjected to three cycles of 75 mg/m 2 Taxotere® (T) and 50 mg/m 2 Doxorubicin (D) in three week intervals. Subsequently followed two cycles of 
high dose chemotherapy treatment with an intermediate interval of 4-6 weeks. In the first cycle 500 mg/m 2 of Vepesid (V), 4000 mg/m 2 of Isofamid 
(I) and 500 mg/m 2 of Carboplatin (C) was administered. The last cycle consisted of 1500 mg/m2 of Cyclophosphamid (Cy) and 200 mg/m2 of 
Thiotepa (T). Both cycles of high dose chemotherapeutic treatment were accompanied by addition of granulocyte colony-stimulating factor (G-CSF), 
and autologous transplant of peripheral blood stem cells (PBSCs). B) The course and outcome of bone marrow sampling. DCC count indicated the 
number of identified DCCs in 1.0 x10 6 mononuclear cells. C) Venn diagram depicting distribution of MRAs (gains and losses) across three types of 
clinical samples. D) Hierarchical clustering (distance: Euclidian; linkage: average) of samples including DCCs, the primary tumor and a metastatic 
lesion. E) Table depicting core MRAs that were found in all DCCs, the primary tumor and the metastatic lesion or in the metastatic compartments 
(DCCs and the lymph metastasis) only. 
doi:1 0.1 371 /journal.pone.0085907.g005 
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ing (NGS). Moreover, the PCR-T2 procedure can be applied 
directly on the WGA products, while RP labeling requires a prior 
purification step. Hence, considering these practical advantages, 
we favor the PCR-T2 technique for processing Amplil 1M WGA 
products. 

Second, we sought to find the most suitable reference for single- 
cell aCGH analysis. Previous reports indicated that different 
biological nature of test and reference samples, i.e. amplified 
single-cell test and unamplified DNA, may influence both the 
sensitivity and the specificity of the single-cell aCGH assays [35— 
37]. These effects can be minimized by application of dedicated 
normalization algorithms [36,38] or matched test and reference 
samples representing the same sample type - i.e. WGA products 
[27,37]. Following this rationale, we determined that multiple 
pooled single-cell WGA products represent the optimal reference 
for single-cell aCGH experiments. This is in line with previous 
findings published by Bi and colleagues [26]. 

The third major improvement could be achieved by usage of 
high-resolution aCGH arrays based on oligonucleotides. Studies 
published in the recent years indicated applicability of the new 
generation of high-resolution SNP and CGH arrays for single-cell 
CGH analysis [26,35,39-43]. In comparison to the previously 
used BAC-based arrays [24,25,44] these technologies offer lower 
qualitative variability of the array slide manufacturing process, 
customizable microarray designs and very low probe spacing. 
Utilization of SurePrint G3 Agilent 4x180 k arrays (median probe 
spacing of 1 3 kb) enabled us to obtain high-quality aCGH results 
of single cells reaching precision comparable to the currently most 
sensitive single-cell aCGH protocol [27]. Despite that previous 
studies indicated that the detection limit of the single-cell aCGH 
may depend on the density of microarray probes [26,39], the 
reproducible detection of alterations as small as 0. 1 Mb suggests 
that 4x180 k arrays provide sufficient resolution. 

Importantly, in our single-cell experiments we have not 
observed systemic fluctuations of log2 values of the microarray 
probes indicating a very homogeneous and unbiased representa- 
tion of single-cell genomes. Therefore, in contrast to other single- 
cell aCGH approaches our method does not require custom 
design of copy number detection algorithms [36,39] nor multiple 
hybridization runs to reliably detect copy number changes at the 
single-cell level [36]. Despite the increased levels of hybridization 
noise observed when processing single-cell DNA, standard 
aberrations recognition algorithms - ADM-2 and CBS [45] - 
provided high quality result. This is true even if old archival DNA 
(i.e. samples extracted from formalin fixed tissue) or re-amplified 
single-cell WGA products were used as a template for the analysis, 
showing high feasibility of the assay. 

The resulting workflow proved to be remarkably sensitive 
allowing quantitative assessment of CNAs in single cells as 
demonstrated by the high correlation of signal intensity (repre- 
sented by log2 ratios of the inspected locus) to copy numbers 
(measured as FISH value) when using single cell DNA. Moreover, 
our assay shows the ability to uncover cellular heterogeneity. 

To date little is known about the level of heterogeneity of DCCs 
and their relation to the primary tumor and the metastatic lesion. 
Analysis of DCCs performed either directiy or upon short-term 
culture provided already evidence for genomic variability of this 
cell population [12,46,47]. This may have clinical implications as 
variant cells may be subjected to natural selection [8]. Our results 
indicate that specific clones within the DCC population may be 
selected by high-dose chemotherapy. These clones (here repre- 
sented by DCC $d from time point 3 and DCC #2 from time 
point 4) are characterized by the presence of highly penetrant 
MRAs shared by PT, DCCs and metastasis, which either indicate 



close affinity to the founder cell of the tumor or suggest that such 
aberrations are essential for tumor formation at autochthonous 
and distant sites. This pattern of clonal selection was previously 
detected during the progression from myelodysplastic syndromes 
to secondary AML [48,49]. In this model a minor subclone, 
harboring essential alterations, survives the selective sweep, gains 
additional mutations which enable its outgrowth leading to the 
clinical relapse [48,49]. Our data indicate that this process may 
also be operative in breast cancer DCCs with gains on 1 7q and 8q, 
as well as losses on chromosomes 6, 12 and 13 representing stable 
genomic alterations that may constitute a sine qua non of tumor 
mass formation. Whether additional aberrations of key clones 
selected after chemotherapeutic treatment (DCC #1 from time 
point 3 and DCC #2 from time point 4) contribute to 
chemotherapy resistance awaits further studies. In summary, our 
analysis shows that DCCs in metastatic patients acquire hetero- 
geneous aberrations on the basis of a stabilized aberrant genome 
with core changes. Such cells form dynamic cell populations from 
which different clones may be selected throughout the course of 
disease. 

In conclusion, our new single-cell aCGH protocol enables 
reliable detection of CNAs in single cells. Further improvement of 
the achieved resolution may be possible by NGS technologies. 
Assisted by NGS-based targeted mutation analysis our assay will 
provide a precise but still affordable tool for monitoring the 
molecular evolution of early and advanced systemic cancer. 

Supporting Information 

Figure SI Selection of best performing PCR-based DNA 
labeling technique. A) Horizontal aCGH profiles of unampli- 
fied gDNA from PT-1590 cell (upper panel) and corresponding 
single-cell WGA product processed with PCR-T1 (middle panel) 
or PCR-T2 (lower panel). Red arrows indicate genomic intervals, 
which were called as aberrant in unamplified bulk gDNA but not 
in the single-cell WGA products. Green arrows indicate aberrant 
intervals detected exclusively in the single cell. B) Vertical aCGH 
profiles specific for chromosome 1 (placed in red brackets in the 
panel A) generated with unamplified gDNA (left panel) and single- 
cell WGA product processed with either PCR-T1 (middle panel) 
PCR-T2 (right panel). Arrows indicate genomic intervals that were 
falsely recognized as balanced when PCR-T1 labeling technique 
was used. C) Vertical aCGH profiles specific for chromosome 8 
(placed in green brackets in the panel A) generated with 
unamplified gDNA (left panel) and single-cell WGA product 
processed with either PCR-T1 (middle panel) or PCR-T2 (right 
panel). Arrows indicate genomic intervals that were falsely 
recognized as balanced when PCR-T1 labeling technique was 
utilized. D) ROC-curves for single-cell aCGH comparing PCR-T1 
vs. PCR-T2. Array CGH profile generated using unamplified 
gDNA of PT-1590 cells was taken as reference for the comparison. 
ROC analysis was performed on a genome-wide basis. A larger 
area under the curve (AUC) indicates higher accuracy of the 
method. 
(TIF) 

Figure S2 Reproducibility of the single-cell aCGH 

assay.A) Use of DNA from a cell pool as reference. B) Use of 
DNA from pooled single cells as reference. In both experiments 
horizontal aCGH profiles and corresponding correlation for the 
assessment of CNAs are shown. Experiments were performed on 
the same single-cell WGA product - PT-1590 single cell 3. 
Pearson's correlation coefficient (r ) was used to assess the 
reproducibility of the technical replicates. 
(TIF) 
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Figure S3 Array CGH using re-amplified single-cell 
WGA products. A) Horizontal genome wide aCGH profiles of 
OE- 1 9 cells generated using unamplified gDNA (upper panel) and 
a re-amplified single-cell WGA product labeled with PGR-T2 
technique (middle panel) or RP labeling approach (lower panel). B) 
ROC-curves depicting the accuracy of the single-cell aCGH 
protocol when performed on re-amplified single-cell Amplil 
WGA products generated using OE-19 cells. Genomic profiles of 
unamplified gDNA of OE-19 cells was used for the comparison. 
ROC analysis was performed on a genome-wide basis. A larger 
area under the curve (AUC) indicates higher accuracy of the 
method. C) Horizontal genome wide aCGH profile of a single cell 
of a female donor hybridized against a male reference DNA (sex 
mismatch experiment). 
(TIF) 

Figure S4 aCGH analysis of PT1590 esophageal cancer 
cell line cells. Horizontal profiles of unamplified gDNA and 
single-cell WGA of PT 1590 cells (depicted also in the Figure 3). 
Note the differences between the profiles of individual cells. 
(TIF) 

Figure S5 aCGH profiles of clinical samples of a 
metastatic breast cancer patient. Horizontal aCGH plots 
indicating the genomic gains and losses detected in eight DCCs 
and corresponding tumor tissue samples (primary tumor and 
lymph node metastases) of a patient with advanced breast cancer. 
Red arrows indicate genomic loci that remained balance in two 
selected cells, despite positively selected in the remaining samples. 
Blue arrows indicated genomic alterations occurring exclusively in 
DCC #2 from the time point 2. 
(TIF) 

Figure S6 Comparison of PCR-T2 and RP-labeling 
approaches on immunostained cells from healthy do- 
nors. Horizontal aCGH profiles of immunostained single-cell 
WGA products generated from white blood cells (WBCs) of three 
healthy individuals. Panels A, C, E depict single-cell samples 
processed with the RP-labeling technique, whereas panels B, D, F 
depict the same WGA samples processed with the PCR-T2 
method. Aberrations were called using two aberration filters: (i) 
minimum number of probes in the region = 10 and minimum 
absolute average log2 ration for a region = 0.3 (upper panels); (ii) 
minimum number of probes in the region = 25 and minimum 
absolute average log2 ration for a region = 0.3 (lower panels). 
Black asterisks indicate genomic loci of randomly distributed false 
positive aberration calls. Green asterisks show locations of artifacts 
detected exclusively upon utilization of the RP-labeling technique. 
(TIF) 

Figure S7 Comparison of PCR-T2 and RP-labeling 
approaches on clinical DCCs. A, C) Horizontal aCGH 
profiles of immunostained single-cell WGA products generated 
using DCC of two prostate cancer patients. All single-cell were 
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characteristics resulting from the application of two PCR-based 
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